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backward Fukui function ^ quantum-c hemic ai descriptors (0 Fukui function) 
Balaban centric Indices -» centric indices 

Balaban distance connectivity Index - Balaban distance connectivity indices 
Bataban distance connectivity Indices 

The formerly proposed and the most important of this series of topological indices is 
the Balaban distance connectivity index J (also called distance connectjvity index or 
average distance yum connectivity), ll is one of the most discriminating -» molecular 
descriptors and its values do not increase substantially with molecule size or number 
of rings; it is defined in terms of sums over each /th row of the distance matrix D, i.e. 
the vsrtcx distance degree o [Balaban, 1982; Balaban, 1983a]. li is defined as: 

where 0/ and Cy are the vertex distance degrees of two adjacent atonis» and the sum 
runs over all the moiecuiar bonds b; B is the number of bonds In the molecular graph 
C7. and C called the - cyclomadc number, the number of rings. The denominator 
C + 1 is a'nojTnalization factor against the numbes^of rings in the niolecule. - Oi/B 
is the average verte:< distance degree; it was observed that within an isomeric series 
the average distance degrees are low in the more branched isomers. 

To ftirther improve the discriminant power of the Balaban index J, a set of new 
LOVTs was defined as: 

o, 

where 5^ is the fth vertex degree. Therefore, the J, index was defined as: 



The idea behind these X-OVIs is that usually the vertices with the bi^est distance 
sums have the lowest vertex degrees, thus enhancing the intramolecular differences 

[Balaban, 1994a]. ' ^ t. ; • c 

The J index for multigraphs is calculated by the distance sums of the - multigraph 
distance mairix where the distances are obtained by weighting each edge with the 
inverse ot its conventional bond order relative copological distance)] the sum runs 
over all pairs of adjacent vertices and B is the number of edges in the graph without 
accounting for their multiplicity. 
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= 5 . [(12 • 8)-'^' + (12 - S)-*^ + (8 . 8)-'^ + (8 • 10)"^' + (10 - 14)-^] « 2.6272 
- 5 . [(12 - 2.667)-^'' + (12 • 2.667)"^'^ + (2.667 • 4)'^ + (4-5)"^'=^ ^ (5 • 14)"^''] = 5.0141 



BoxB-1. 

In order to account for both bond multiplicity and heteroatoms, Badaban modified 
distance comiectirity indices and were proposed [Balaban, 1986a; Balaban etaL, 
1990a]. These arc defined in the same way as the Balaban distance connectivity index 
but derived from the ~^ multigraph distance mcurix *D instead of the original (^stance 
matrix D: 

- '"-c^ ■•»/)"'"• 

where B is the bond number, C is the cydomatic number, and the sum runs over all 
bonds b in the graph, each being weighted by the invei^e square root of the product of 
the -* mukigraph distance degree of the incident vertices. The distance degrees are 
calculated as: 



1 
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Balaban distance connertjvrty indices 



and X. = 0.4196 - 0.0078 • Z, -f 0.1567 • L, 



[•DL and Y/ = 1.1191 + 0.0160 • Z,- - 0,0537 



where *a is the vertex distance degree calculated on mulrigraph disiancc mairix D, 
the quantities X and Y are recalculated atomic Sanderson electronegativities and 
covalent radii relative to carbon atom, obcaiaed as a function of the atomic number 7^ 
and the principal quantum number L, of the atom; for atoms differeni from B. Q 
O. F, Si, P» S, Q, As, Se, Br, Tc, and I the X and Y valuer are set at one. X and Y 
indices account for the presence of heteroatoms in the molecule. 

Another generalization of the Balaban index J, so as to account for heteroatoms in 
the raolecule, is the Barysz index calculated on the -* Barysz distance matrix. 

J J indices derived from the Wiener matrix were proposed as a generalization of 
the BaJaban index in analogy with the Kier-Hall connectivity indices. 

The 3D-Balaban index ^i^j was derived from the - geometry mairix G as: 



3D 



C+1 

where TB is the - IvanciucBalaban operator, ^o,- and *^0/ are the -» geometric distance 
d^^ree of the two vertices incident with the b bond [Mihalic ei aL, 1992a]. 
A Bfllftban-typo Indeaf DJ [Baiaban and Diudea, 1993] was defined as: 



where A is the. atom number, f is the muliigraph factor, w is a weighting factor 
acconnting for heteroatoms, and the inner sum runs over all vertices ; at distance 1 
from the ( th atom, i-e. vertices bonded to the t'th aioro; dj are local vertex invariants 
accounting for heteroatoms and bond multiphcity. When the factor w is equal to one 
and the multigraph factor is equal to zero then the index Di is reLatad to the Balaban 
index J by the following: 

CQ [Balaban and Quintar, 1983] [Barysz ei aL, I9S3a] [BalabaD and Filip, 1984-] [Bala- 
ban ezfll, 1985e] [Sabljic, 1985] [Mekenyan ei aL, 1957] [Balaban and Ivanciuc, 1989] 
[Balaban et aL, 1990b] [Balaban et ai, 1992a] [Nikolic-^r al, 1993a] [Guo and Randic, 
1999] 

Balaban ID number ID numbers 

Balaban modJfiod distance connectivity indices -* Balaban distance connectivity 
indices 

Bala ban^pe index Balaban distance connectivity indices 

Bartell resonance energy resonance indices 

bary centre ; centre 0/^ mass - centre of a molecule 

Barysz Index weighted n>atrices 

Barysr distance matrix - weighted matrices 
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Kekuie structure count ; Keka!6 numt>er 

Kellog and Abraham (nteraction field molecular interaction fiefds 

(O hydrophobic fields) 

Kier alpha-modifiod shape descrrptorft - iGar shape descriptors 
Kier bond rigidity index flexibility fndJces 

Kier-Hall connectivity Indices ; connectMty indices ofmth order -* connectivity indices 

Kier-Hall conn«ctfyf(yniatrb( weighted matrices 

Kier-Hall electronogatfvlcy vertex degree * 

Kior-Hall solvent polarity Index electric polarization descriptors 

Kier molecular flexibility index flexibility indices 

Kler steric descriptor steric descriptors 

Klcr shape descriptors (k) 

Topological shape descriptors '")C defined In icmis of the number of graph vertices A 
and the number of paths '"F with leugth m (m 1^,3) in the H-depieted molecular 
graph, according to the following: 



3r> ,3p 
* max * n 



.(^-3)(>L-2)^ 
iA^\){A^^f 



for even ^ > 3) 

fox odd (A > 3) 



where "^Ftain and '"Pmmx are the minimum and maximum m th order path count iij j 
the molecular graphs of molecules with the same -* atom number A [Kier, 1985; Kier, 
1966b]. These extremes are obtained from two reference structures chosen in an iso- 
meric series and, for the i th molecule, is therefore: 

'"Pnda < '"Ft < ^F^ ^ ,.-'a1 

The reference structure for ^PmiA Is the Shear grc^h while for Vmax ^^'^1-^ 
-» complete graph in which all atoms arc bonded to each other; their numerical valued 
are calculated as follows: 

The scaling factor of 2 in the numerator of index formula makes the value^ 
= >l when there are no cycles in the graph of the molecule. Monocyclic molecules g 

have a lower value and bicycKc structures have an even lower value. The structural^ 

infonnation encoded in is related to the complexity, or more precisely, the number^ 

of cycles of a molecule. 

The reference structure for ^Ftn\n » the linear graph, while for ^Fjj^ it is the -» stal 

graph, in which all atoms but one are adjaceni to a central atom; their numerical vaf 

ues are calculated as follows; 

, (>t-l)(>l-2) 
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Kler shape descriptors (k) 



where is ibc total number of vertices in the graph. The scaHng factor of 2 in the 
numerator of he index formula makes the value ^ « i4 - 1 for all linear graphs. Th© 
infonaation encoded by h: index is reiatod to the degree of star graph-likeness and 
linear graph4ikencss, L e. ^ encodes infonnation about the spatial density of atotus in 
a molecule. 

The reference structure for ^Pmin is the linear graph while for ^Fnux ^ ^® 
star graph; their numerical values are calculated as follows; 



^P max = ^ 



for even A 



4 



The scaling factor of 4 is used in the numerator of index to bring onto approx- 
imately the same numerical scale as the other kappa indices. The ^ values are larger 
when molecular branching is nonexistent or when it is located at the extremities of 
a graph; ^ encodes information about the centrality o£ branching- 




^P^S 1k = 6.000 
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BoxK-L 



^^ak? mio account the different shape contribution of heteroatoms and hybridizil-f 
l^tea, Kier aipha-^nodi'fied shape descriptors (m - 1^3) were propose^J 
>;.I956a] by the following: 
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M+a-3)(A4-a-2) ^^^^ ^ ^ 3) 

f/l + a.-l)(/l + a-.3f for odd A (A > 3) 



where a is a parameter derived from the ratio of the - covalent radius i?, of the ith 
atom relative to the sp^ carbon atom (.Rcita)- 

nie only nori-zero contributions to a we 'iJven by heteroatonis or carbon atoms with 
a valence state dijSferent from sp' (Table K;l)- ^ 



Ikble K-1- Cavalcnt radius and a pajMieier values. 
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Kappa indices can also be calculated for molecular fragmenu and ^^^^}^°^^ 
X TTie calculatiofl of these indices for groups was peifonned larng a 'pseudo-raole- 
culi" X-X- two fragment X of the same kind are linked togetlxer. kappa values an^ 
calculated for the pseudo-molecule and this is then divided by two. ,. 

In order to quantify the shape of the whole molecule, Kier proposed a bnear comlu- 
naio^of io above Sfinod k^ccs, each representing a particular shape attribute of 
the molecule: , , 

where <^ is the - Kier symmetry index used to encode the shape contnbutions due to 

'"^SS combinations of k indices were also propesed as indices of molecular flexf 
iy,^^Ki.rmoU<?^r flexibility index) and sterlc effects (- Kierstenc descriptor). 
ea [Kier, 1986c] [Kier. 1987a] [Kier. 19S7b) tKier l987c] (Gombar Jam 19gl 
SJofaiz 19891 [Kier. 1990] [HaU and Kier. 1991] [Skvoitsova er 199^] [Kier, 
1997] [Hall and Vaughn, 1997b] 
Kier symmetry Index - symmetry descriptors 

Kz index - Hosoya Z matrix /!i 
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Appendix C. Software 



Some packages explicitly related to the calculation of the molecular descriptors for 
QSA-R/QSPR are collected below, in alphabetic order. 

General programs for computational quantum-chemistry, molecular modelling and 
iogF calculations are not explicitly considered in this list. An extended list of compu- 
tational chemistry programs can be found at the WebSite httpV/ww.netsci.org/ 
Rcscurcea/Softwarc/. 



ADAFT 


Fro(. ?.C JuTJ, PcxmSutc University, University Park, PA 16802, USA 


Description: 

Reference: 
WebSite: 


A QSAK toolkit with descriptor gencracion (topoloeical, gcomctricsa, 
electronic, and physicochcnaicaJ descrip)«ors), variable selection, regression and 
anifidal neural network modelling. 

(Jurfle(a£, 19791 

htCp-7/rcuB-cbcjDQ. psu.edu/ 




ASP 


Oxford Molociilar Ud., Oxford Sdcncj^ Park, OxJord 0X4 4GA, UK 


Description: 
WebSite: 


Calculates a qoaatitative measure of molecular siznUaHty b«6ed on molecule 
aiignmcat. shape and electrcinic properties. Within TSAJR 3D paclcage. 

hltp://www.oxnDLOl.coin/ 




CERIUS^ 


Molecular SimuUtioos Inc. - 96&5 Scranton Road, San Diego. CA 92121-7357, 
USA 


WebSite; 


C2-D«5ortptor5* provides a range of generic dcscriploo, deacribiag topological, 
elcctromc and structural features. 

htip;;A»Aw.msi.coiii/li<c/produci5/cerius2/xnodule3/descTipior.h^ 




CODESSA 


Semlchem Ina - 7204 Mullen, Shawnee, KS 66216, USA 


Description: 

Ketercnco: 
WebSite: 


Calcularioo of several lopologicaJ, geometrical, cozmituuonal, thermodynamic, 
clecirostatiCf and quontum-ctieniical descriptors, including tools for regression 
modelling and variable selection. 

[Katritskycfoi. 1995} 

hiip-M www.5craichcm»com/ 




DRAGON 


Prot R. Todeschmi - disin*buted by Talctc ail, via Pisani 13, 20124^ Milano, Italy 


Dwriptlon: 
WebSite: 


Calculation ot several sets of molecular dcscripton from molecular georndrics 
(topological, gcomccrical, WHIM, 3D-MoRSE, molecular profiles, etc.). 

htlpr//www.dis3t.unlJ0iibJc/chm/ 




GRIN/GRID 


Molecular Discovery Ltd. - West Way House, Elms Parade. Oxford OX2 9LL, 
UK 


Descriplioo: 
Reference: 


Calculates the GRID empirical force field nigrid poinfi L-a^t release: VU - 
[Goodford, 1933] 



1^016/OlS 
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HQSAR 



JDc^cripdon.' 
WebSite: 



Tripoc Inc. ~ 1699 Socib Ha^ey Rd., Sfc.Loufs, M O 53144-2913. VSA ~ 



A pan 0/ Lhc SYBYL enWronTaeut provWing hologr^iai descriptors, 
hhpy/www.tripo&join/ 



HYBOT-PLUS 



DescriptiojQ: 
Reference: 
WebSite: 



ProTO. Raevgky - RusaAn Academy Of Science, IPAC 



Calculaiion of hydrogen bond and £n:c energy £acton. 
[Raevsky, 1997J 

hnp://\v>vw.ipacacni/qsar/index.hcin 



HYPERCHBM 6 



Dcacripiion: 



WebSite 



Hypcfcube, Inc. ■ 1 US Nw 4th Street, Gainavtae, FL 32601, USA 



Calculation of (optimized geometries with several compuiationaj meijioda. 
^r«£il°Il?!f ^ '"^^ molecular volurac, molar refr^c<Mry. Jog 
polanaabtlity and atomic charges. Lactrolcasa; 6 
Mtpy/wiHfw.hyper.com/ 



MOLCONN-Z 



Description: 



Prot L.H. HoU - 2 Davia Street, Quincy, MA USA 



Siicccisor of MOLCONN.X, MOLCONK-Z calcuJates the mo« well-fcrwwn 
I^rS^ 30"^"^ inctuding alcctrotopologi«U and orthogonalizcd jnctices 



WebSite; 


1 ii^^J-// www,«ic.vabiot©ch.cofn/molcorin/ni3nu4la/310^ html 






MULTICASE 


Mulucafie Inc - PO 22517, Beichwcod. OH 44122, USA 


De^cfipdoc: 
Relerence: 
WebSite: 


Prcaiciion of biological acrivitle* by Eubstroctm-e descdpiors. ^ 
[Ktopman, 1992J 




OASIS 


Pro£ a Melcenyan - Bourgas Umver»ity, 8010 Bour^as, Bulsaria 


Description: 
Reference: 
WebSite; 


CnlcuUtion of sKric, clcctronlt; and hydrophobic desccipiors. 
[MeJcenyan €t oL, 199O9J 
bttp7/oraega.btu.bg/-omckenya/ 



FETKA 



Pcscripdon; 
Referexice: 



Molecular Necvrofks GmbK-LAagemarckplatz 1. 1^91054 ErUnscn (Gennany) 



Hmpirical jneUiods tor the calculation of charges and bond onergies for use i 
[Omnciger, 1988; L6w smxi Sailer. 19531 



POLLY 



Doscripaon: 
Reference: 



Ft^t S. BaiAk - Minnesota Univtsrsiiy of Dulutb, 5013 Miller Tiuak HisirMiy ^ 
Duluth.MN 55811, USA ^*xiis«^y-: 



Calculation of lopoJojicai connecdviiy iadiccs. lUst releaser 2.3 
[BasaJkg/tfi:, 1988b] 
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Appendix C. Software 



SdVision - 200 Wfaeelcr Kottd, Boriington, MA 01803, USA 



ty. log 



Olcuiaoon of Mvcrai topological raolacuJar cjfi^cripcocs (^nncctivity. shapt, 
ciootrotopKHogical dcscriptoR). 

hnp:/Avww.8civiaiop.coDi/ 



Description; 
WebSiie: 



TYlpos Inc. - 1699 South Haaley Rd^ StUuia, MO 63144-2913, USA 



SYB YL modulo for ihe calculation of EVA dcscripion, CoMFA ajid CoMSL\ 
fields, also including several QSAR tools. Ust release: 6.1 
hflp://^w^vw.tr!poicom/ 



TSAH 



Descriptioa; 
Reference: 



O^tford MojcciUar U(L, Oxford Scjance Park, Oxford OX4 4GA, UK 



Statiitical and database fujictionfi with molecular and aubffiitucoc prooeirv 
calculations- Within TSAK 3D package. ^ ^ 

hup://www.oxinolcoin/ 
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